*very high BCV implies too much noise within conditions. Likely caused by tissue and genetic heterogeneity. A typical common dispersion for well controlled human data is around .4 and this data is around 1.

FIGURE 1: An Interactive PCA Analysis of the chordoma (red) and notochord (blue) samples provided by http://xavierlab2.mgh.harvard.edu/

Transcriptome libraries were aligned to the Genome Reference Consortium Human Build 38 (GRCh38) reference genome before trimming low count reads and normalizing to library size in edgeR. (The plot can be manually rotated to change the perspective)

The expression profiles appear heterogeneous with no clear distinction between conditions. In my opinion, this is why it will be critical to have access to donor metrics and also to minimize variables such as sample prep or tissue heterogeneity in the samples. Alternatively, we could use a much larger set of samples.







FIGURE 2: Boxplot of CHI3L1 expression in chordoma and notochord samples

CHI3L1 expression levels are very low in both chordoma and notochord samples with no significant difference in expression between the two tissue types.






FIGURE 3: An interactive heatmap of all differentially expressed genes between the chordoma and notochord samples

All genes that met the Benjamin-Hochberg false discovery rate cutoff of 0.05 are plotted. (Hovering over a cell will give gene information).

Again, the samples appear largely heterogeneous with many of the lacking a consistent expression profile between conditions.